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ABSTRACT 

There are three basic styles of prettyprinting in current usage. I present a simple 
hypothesis that governs which style is commonly found for any given language, based on 
the observation of several programming languages. 

V 

1 . Background 

Like most programmers, I have a preference for a particular style of prettyprinting. 
Unfortunately, some of my colleagues have adopted what seemed (to me) a very strange 
style. I thought I could make them see the "error of their ways" if I could show them that no 
one uses their style of prettyprinting. However, I found that quite a number of programmers, 
and in fact whole language communities, seem to use their particular style of prettyprinting. 
In the course of this small bit of research, I discovered some simple rules that seem to 
govern what is acceptable prettyprinting style for a programming language. 

I have come to view this diversity of prettyprinting styles as a linguistic phenomenon, and 
thus I can make no (objective) arguments as to why one style should be better than another. 
Of course, I have my own (subjective) reasons for preferring my favorite style, but they are 
not of interest here. 


2. The Styles 

There are three styles commonly found for matching a closing delimiter (for instance end 
or ')’) with an opening delimiter. Other styles are possible, but seldom if ever used. 

2.1 . Ada Style 

The Ada reference manual [10] uses and encourages the following prettyprinting style, 
keyw 
body 

end keyw 

That is, the end keyw keyword is aligned directly under the keyword, and the body is 
indented. The amount of indentation of the body is not important, as long as it is definitely 
indented. 
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2.2. PL/I Style 

PL/I [3, 9] programs are often (but not always) prettyprinted with the following style, 
keyw 
body 

end 

Again the body is indented, but note that the end keyword is aligned with the body, and not 
with the keyword (often do) that begins the structured statement. 

2.3. Lisp Style 

Lisp [19] prettyprinters use the following style. 

(key 

body) 

That is, the closing parenthesis is not given a separate line, but is placed on the end of the 
last line of the body that it closes. 

2.4. Other Possible Styles 

If one thinks about it, there is no a priori reason to rule out other possible styles. For 
instance, indenting the closing keyword even farther than the body, 
keyw 
body 

end 

Or indenting the closing keyword halfway between the indentation of the body and the 
indentation of the opening keyword. 

keyw 

body 

end 

Or not indenting the body or the closing keyword at all. 
keyw 
body 
end 

Note that all the styles commonly in use for languages with structured statements indent the 
body of a structured construct. Thus one may surmise that one will never see a 
prettyprinting style that does not indent the bodies of structured statements. (Languages 
like BASIC [4] and SNOBOL [13] do not have structured statements to indent.) 

These alternative styles seem rarely, if ever to be used for languages with structured 
constructs. Probably they make visual pattern recognition of nesting levels difficult. 
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When I first started to look at prettyprinting styles, I decided to survey whatever books I 
had that contained examples of formatted programs. Because I have access to quite a few 
programming language manuals and references, I was able to sample a large number of 
languages. It was during this process that I started to observe the three classes of 
prettyprinting styles shown above. Part of my survey apears below. 


Ada Style 

Ada [10] 

Pascal [15] 

Edison [7] 

Modula-2 [23] 

Algol 68 [21] 
Concurrent Pascal [6] 


PL/I Style 

PL/I [9] 
CSP/k [14] 
CLU [18] 


LISP Style 

LISP [22] 
BCPL [20] 
ML [12] 
COBOL [2] 
Prolog [8] 
MDL [11] 


One should not take the sheer number of languages in any one category as some 

indication of "goodness". 

Based on this data I made the following hypotheses. 

1 • If a language has a uniform, and small (typically single character) closing 
keyword which is also a punctuation symbol (such as ’)’ in LISP or in COBOL 
and Prolog), then programmers tends to use LISP style prettyprinting. 

2. If a language has a uniform, but rather larger or more visually arresting closing 
keyword (such as end in PL/I and CLU), then PL/I style is preferred. 

3. If a language has several distinct closing keywords (such as esac and fi in Algol 
68) or "end x" combinations (such as end if in Ada), then Ada style is preferred. 

These hypotheses work fairly well in some cases. For instance, the language VAL [1] has 
a syntax similar to CLU’s except that it has separate "end_x" keywords for all its structured 
statements. The theory predicts prettyprinters will use Ada style for VAL, and indeed this is 
the case in [1], Similarly, all LISP variants and SmallTalk (which uses ’]’ in some statements) 
tend to be prettyprinted using LISP style. 

A striking example of the theory’s correct prediction occurs for Russell [5]. This language 
has both differentiated end keywords (such as in ... ni) and a common terminator used in 
may cases (’{’...’}’). The hypotheses predict that ni is lined up with in as in Ada style, and ’}’ 

ls simply tacked on the end. This is in fact the case in several Russell programs that I have 
seen in [5]. 

However, there are some problems with the theory. One interesting counter-example is 
ne ’C' language. Kerhenihagn and Ritche [16] format using the Ada style, and I have seen 
several programmers use PL/I style. C has a short, universal delimiter for its structured 
statements, namely '}', so the theory would predict that it would be prettyprinted using LISP 
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Nil 


ifiw 


style; but this is almost never the case. 


Another problem is that some authors seem to advocate the same style for every language. 
For instance, Ledgard [1 7] advocates Ada style for PL/I code. 


Perhaps there are some factors that the theory does not take into account, such as the 
legacy of the language (which languages it resembles or was based on), or the personal 
preferences of influential language designers or authors. 
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4. Conclusions 


There are three basic prettyprinting styles used for programming languages. Some 
regular patterns seem to emerge which allow one to predict from the syntax of the language, 
how one will commonly see it indented. It is thus quite impossible to label one prettyprinting 
style as "better" than another. 
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